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29 ABSTRACT 

30 The emergence of Middle East respiratory syndrome showed once again that 

31 coronaviruses (CoVs) in animals are potential source for epidemics in humans. To 

32 explore the diversity of deltacoronaviruses in animals in the Middle East, we tested 

33 fecal samples from 1,356 mammals and birds in Dubai. Four novel deltacoronaviruses 

34 were detected from eight birds of four species by RT- PCR: FalCoV UAE-HKU27 

35 from a falcon, HouCoV UAE-HKU28 from a houbara bustard, PiCoV UAE-HKU29 

36 from a pigeon and QuaCoV UAE-HKU30 from five quails. Complete genome 

37 sequencing showed that FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV 

38 UAE-HKU29 belong to the same CoV species, suggesting recent interspecies 

39 transmission between falcons and their preys, houbara bustards and pigeons possibly 

40 along the food chain. Western blot detected specific anti-FalCoV UAE-HKU27 

41 antibodies in 33 (75%) of 44 falcon serum samples, supporting genuine infection in 

42 falcons after virus acquisition. QuaCoV UAE-HKU30 belongs to the same CoV 

43 species as PorCoV HKU15 and SpCoV HKU17 discovered previously from swine 

44 and tree sparrows respectively, supporting avian-to-swine transmission. 

45 Recombination involving the spike protein is common among deltacoronaviruses, 

46 which may facilitate cross-species transmission. FalCoV UAE-HKU27, HouCoV 

47 UAE-HKU28 and PiCoV UAE-HKU29 were originated from recombination between 

48 WECoV HKU16 and MRCoV HKU18; QuaCoV UAE-HKU30 from recombination 

49 between PorCoV HKU15/SpCoV HKU17 and MunCoV HKU13, and PorCoV 

50 HKU15 from recombination between SpCoV HKU17 and BuCoV HKU11. Birds in 

51 the Middle East are hosts for diverse deltacoronaviruses with potential for interspecies 

52 transmission. 
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54 IMPORTANCE 

55 During an attempt to explore the diversity of deltacoronaviruses among mammals and 

56 birds in Dubai, four novel deltacoronaviruses were detected in fecal samples from 

57 eight birds of four different species: FalCoV UAE-HKU27 from a falcon, HouCoV 

58 UAE-HKU28 from a houbara bustard, PiCoV UAE-EIKU29 from a pigeon and 

59 QuaCoV UAE-HKU30 from five quails. Genome analysis revealed evidence of recent 

60 interspecies transmission between falcons and their preys, houbara bustards and 

61 pigeons possibly along the food chain, as well as avian-to-swine transmission. 

62 Recombination, which is known to occur frequently in some coronaviruses, was also 

63 common among these deltacoronaviruses and predominantly occurred at the spike 

64 region. Such recombination, involving the receptor binding protein, may contribute to 

65 the emergence of new viruses capable of infecting new hosts. Birds in the Middle East 

66 are hosts for diverse deltacoronaviruses with potential for interspecies transmission. 
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68 INTRODUCTION 

69 Coronaviruses (CoVs) infect humans and a wide variety of animals, causing 

70 respiratory, enteric, hepatic, and neurological diseases of varying severity. Based on 

71 genotypic and serological characterization, Co Vs were traditionally divided into three 

72 distinct groups (1-3). In 2008, the Coronavirus Study Group of the International 

73 Committee for Taxonomy of Viruses (ICTV) has replaced the traditional groups 1, 2 

74 and 3 CoVs with three genera, Alphacoronavirus, Betacoronavirus and 

75 Gammacoronavirus (4). As a result of the unique mechanism of viral replication, 

76 CoVs have a high frequency of recombination (1). Their tendency for recombination 

77 and the inherently high mutation rates typical of RNA viruses may allow rapid 

78 adaptation to new hosts and ecological niches (5-9). 

79 The severe acute respiratory syndrome (SARS) epidemic and the discovery of 

80 SARS coronavirus (SARS-CoV)-like viruses from palm civets in China have boosted 

81 interests in discovery of novel CoVs in both humans and animals (10-15). In 2004, a 

82 novel human CoV (HCoV) of the genus Alphacoronavirus, human coronavirus NL63 

83 (HCoV NL63), was reported independently by two groups in 2004 (16, 17). In 2005, 

84 we also described the discovery of another novel HCoV, human coronavirus HKU1 

85 (HCoV HKU1), under the genus Betacoronavirus (18-20). As for animal CoVs, we 

86 and others have described the discovery of SARS-CoV-like viruses in horseshoe bats 

87 in Hong Kong and other provinces of China (21, 22). Based on these findings, we 

88 expanded molecular surveillance studies to examine the diversity of CoVs in bats of 

89 southern China, during which at least nine other novel CoVs were discovered, 

90 including two novel lineages in Betacoronavirus, lineage C and D (23-25). Other 

91 novel CoVs in bats and other animals have also been discovered by various research 
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92 groups, which has broadened our knowledge on the diversity and evolution of CoVs 

93 (7,26-29). 

94 Birds are the reservoir of major emerging viruses, most notably, avian 

95 influenza viruses (30). Due to their flocking behavior and abilities to fly over long 

96 distances, birds have the potential to disseminate emerging viruses efficiently among 

97 themselves and to other animals and human. Yet, the number of known Co Vs in birds 

98 is relatively small as compared to bats. In 2009, we described the discovery of three 

99 novel CoVs in three families of birds, named bulbul coronavirus HKU11 (BuCoV 

100 HKU11), thrush coronavirus HKU12 (ThCoV HKU12) and munia coronavirus 

101 HKU13 (MunCoV HKU13) (31). These three CoVs formed a unique group of CoV, 

102 which were subsequently classified as a novel genus of CoV, Deltacoronavirus (4). 

103 Recently, we have further discovered seven additional deltacoronaviruses: one from 

104 pigs, named porcine coronavirus HKU15 (PorCoV HKU15), and six from birds, 

105 named white-eye coronavirus HKU16 (WECoV HKU16), sparrow coronavirus 

106 HKU17 (SpCoV HKU17), magpie robin coronavirus HKU18 (MRCoV HKU18), 

107 night-heron coronavirus HKU19 (NHCoV HKU19), wigeon coronavirus HKU20 

108 (WiCoV HKU20) and common-moorhen coronavirus HKU21 (CMCoV HKU21) (32). 

109 Subsequently, PorCoV HKU15 was found to be present widely in pigs and Asia and 

110 North America, and has been associated with fatal outbreaks in pig farms (33-42). 

111 The findings supported that deltacoronaviruses have the potential for avian-to- 

112 mammalian transmission and emergence in mammals. We hypothesize that there are 

113 other previously unrecognized deltacoronaviruses present in geographical locations 

114 other than Hong Kong, which may also have the potential for emergence. To test this 

115 hypothesis, we carried out a molecular epidemiology study in 1,356 mammals and 

116 birds in Dubai, The United Arab Emirates in the Middle East. Based on the results of 
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117 comparative genome and phylogenetic analyses, we propose four novel CoVs in 

118 Deltacoronavirus. The results of sero-epidemiological studies were also discussed. 
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119 RESULTS 

120 Animal surveillance and identification of four novel deltacoronaviruses. A total 

121 of 1,356 fecal samples from 1,164 mammals and 192 birds were obtained (Table 1). 

122 Reverse transcription-polymerase chain reaction (RT-PCR) for a 440-bp fragment in 

123 the RNA-dependent-RNA polymerase (RdRp) genes of CoVs was positive in 

124 specimens from eight birds of four species. Sequencing results suggested the presence 

125 of four novel deltacoronaviruses: the first (falcon CoV [FalCoV] UAE-HKU27]) from 

126 one falcon (family Falconidae), the second (houbara CoV [HouCoV] UAE-HKU28) 

127 from one houbara bustard (family Otididae), the third (pigeon CoV [PiCoV] UAE- 

128 HKU29) from one pigeon (family Columbidae) and the fourth (quail CoV [QuaCoV] 

129 UAE-E1KU30) from five quails (family Phasianidae) (Table 1, Fig. 1). None of the 

130 1,164 mammals tested were positive. 

131 Genome organization and coding potentials of the four novel 

132 deltacoronaviruses. The complete genome sequences of one strain each of FalCoV 

133 UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 and two strains of 

134 QuaCoV UAE-HKU30 were obtained by assembly of the sequences of RT-PCR 

135 products from the RNA extracted from the corresponding individual specimens. 

136 The genome sizes of the four novel CoVs ranged from 25,871 (QuaCoV UAE- 

137 HKU30 strain 1101F) to 26,162 bases (FalCoV UAE-HKU27 strain 988F) and their 

138 G + C contents ranged from 0.39 to 0.42 (Table 2). Their genome organizations are 

139 typical of CoVs, with the gene order 5'-replicase ORFlab, spike (S), envelope (E), 

140 membrane (M), nucleocapsid (N)-3' (Fig. 2 and Table 3). Both 5' and 3' ends contain 

141 short untranslated regions. The replicase ORFlab occupies 18.363 to 18.678 kb of the 

142 genomes (Table 3). This open reading frame (ORF) encodes a number of putative 

143 proteins, including nsp3 (which contains the putative papain-like protease [PF p10 ]), 
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144 nsp5 (putative chymotrypsin-like protease [3CL p10 ]), nspl2 (putative RdRp), nspl3 

145 (putative helicase) and other proteins of unknown functions. Overall, the cleavage 

146 sites for the non-structural proteins in ORFlab of FalCoV UAE-HKU27, HouCoV 

147 UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 were similar to other 

148 deltacoronaviruses, except for nsp3/nsp4 and nspl5/16 in FalCoV UAE-HKU27, 

149 HouCoV UAE-HKU28 and PiCoV UAE-HKU29 (Table 4). In fact, the amino acids 

150 downstream to the putative cleavage site at nsp3/nsp4 are quite variable across 

151 different deltacoronaviruses (Table 4). As for the amino acids downstream to the 

152 putative cleavage site at nspl5/nspl6, they are AL instead of SL (Table 4). 

153 The four novel CoVs display a similar genome organization and differ only in 

154 the number of ORFs downstream of N (Fig. 2). Their transcription regulatory 

155 sequences (TRSs) conform to the consensus motif 5'-ACACCA-3' (Table 3), which is 

156 unique to deltacoronaviruses. Interestingly, similar to other deltacoronaviruses, the 

157 perfect TRS of S in the genomes of the four novel Co Vs were separated from the 

158 corresponding AUG by a large number (140) of bases (Table 3). This is in contrast to 

159 the relatively small number of bases between the TRS for S and the corresponding 

160 AUG (range: 0 base in HCoV NL63, Rhinolophus bat coronavirus HKU2 [Rh- 

161 BatCoV HKU2], HCoV FIKU1, bovine coronavirus [BCoV], human coronavirus 

162 OC43 [HCoV OC43], mouse hepatitis virus [MHV], porcine hemagglutinating 

163 encephalomyelitis virus, SARS-CoV and SARS-related Rhinolophus bat coronavirus 

164 HKU3 [SARSr-Rh-batCoV HKU3] to 52 bases in infectious bronchitis virus [IBV]) 

165 in alphacoronaviruses, betacoronaviruses and gammacoronaviruses. Similar to other 

166 deltacoronaviruses, the genomes of the four novel Co Vs contain putative PL pro , which 

167 are homologous to PL2 pro of alphacoronaviruses and betacoronavirus subgroup A and 

168 PL p10 of betacoronavirus subgroups B, C and D and gammacoronaviruses. Besides, 
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169 one ORF (NS6) is found between M and N of the genomes of the four novel CoVs. In 

170 all the four novel CoVs, one ORF (NS7a) is present overlapping with N. For FalCoV 

171 UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29, three ORFs (NS7b, 

172 7c and 7d), and for QuaCoV UAE-HKU30, two ORFs (NS7b and 7c) are present 

173 downstream of N. BLAST search revealed no amino acid similarities between these 

174 putative non-structural proteins and other known proteins and no functional domain 

175 was identified by PFAM and InterProScan, except that NS7a of FalCoV UAE- 

176 HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 were homologous to NS7a 

177 of WECoV HKU16, NS7b homologous to NS7b of CMCoV HKU21 and NS7c 

178 homologous to NS7a of ThCoV HKU12, and that NS7a, NS7b and NS7c of QuaCoV 

179 UAE-HKU30 were homologous to NS7a, NS7b and NS7c of SpCoV HKU17, 

180 respectively. Transmembrane helices, predicted by TMHMM, were found only in 

181 NS7c of FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 at 

182 positions 4 to 23 and 30 to 52 among all the putative accessory proteins downstream 

183 to the N genes of FalCoV UAE-HKU27, HouCoV UAE-HKU28, PiCoV UAE- 

184 HKU29 and QuaCoV UAE-HKU30. Each of the genomes of FalCoV UAE-HKU27, 

185 HouCoV UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 contains a 

186 stem-loop II motif (s2m) (residues 25,924 to 25,965, 25,924 to 25,965, 25,932 to 

187 25,973 and 26,649 to 26,690, respectively), a conserved RNA element downstream of 

188 N and upstream of the polyA tail, similar to those in IBV, TCoV, SARSr-Rh-BatCoV, 

189 SARS-CoV as well as other deltacoronaviruses. 

190 Comparison of the amino acid identities of the seven conserved replicase 

191 domains for species demarcation (ADRP, nsp5 [3CL pro ], nspl2 [RdRp], nspl3 [HelJ, 

192 nspl4 [ExoN], nspl5 [NendoU] and nspl6 [O-MT]) (4) among the four novel CoVs 

193 is shown in Tables 5A and 5B. In all the seven domains, the amino acid sequences of 
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194 FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 showed 

195 more than 90% identity among each other, but their overall amino acid sequences 

196 showed only 84.4% identity to those of WECoV HKU16, indicating that these three 

197 CoVs should be subspecies of a novel CoV species (Table 5A). As for QuaCoV UAE- 

198 HKU30, its overall amino acid sequences showed more than 90% identity to those of 

199 PorCoV HKU15 and SpCoV HKU17, indicating that QuaCoV UAE-HKU30, 

200 PorCoV E1KU15 and SpCoV HKU17 should be subspecies of the same CoV species 

201 (Table 5B). 

202 Phylogenetic analyses. The phylogenetic trees reconstructed using the 

203 nucleotide sequences of 3CL pro , RdRp, Hel, S and N of the four novel Co Vs and other 

204 deltacoronaviruses are shown in Fig. 3 and the corresponding pairwise amino acid 

205 identities shown in Table 2. In all five trees, FalCoV UAE-HKU27, HouCoV UAE- 

206 HKU28 and PiCoV UAE-HKU29 were clustered together (Fig. 3). In the 3CF pro , 

207 RdRp, Hel and N trees, FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV 

208 UAE-HKU29 were clustered with WECoV HKU16, whereas QuaCoV UAE-HKU30 

209 was clustered with PorCoV HKU15 and SpCoV HKU17 (Fig. 3). However, in the S 

210 tree, FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 were 

211 most closely related to MRCoV HKU18, whereas QuaCoV UAE-HKU30 was most 

212 closely related to MunCoV HKU13 (Fig. 3). 

213 Recombination analysis. For the FalCoV UAE-HKU27, HouCoV UAE- 

214 HKU28 and PiCoV UAE-HKU29 cluster and QuaCoV UAE-HKU30, bootscan 

215 analysis showed possible recombination sites in the S gene (positions 20,300-24,300 

216 for the FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 

217 cluster [Fig. 4A] and positions 19,900-23,300 for QuaCoV UAE-HKU30 [Fig. 4B]). 
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218 Estimation of divergence dates. Using the Bayesian Skyline under a relaxed 

219 clock model with an uncorrelated lognormal distribution, the mean evolutionary rate 

220 of CoVs was estimated as 1.027 x 10 4 nucleotide substitutions per site per year for 

221 the RdRp gene. Molecular clock analysis using the RdRp gene showed that the mean 

222 time to the most recent common ancestor (tMRCA) of Deltacoronavirus was 

223 estimated to be April 82 (95% highest posterior density [HPD], 822 BC to 1824), that 

224 of HouCoV UAE-HKU28/PiCoV UAE-HKU29 to be March 2011 (95% HPD, 

225 October 2009 to February 2013), that of FalCoV UAE-HKU27/HouCoV UAE- 

226 HKU28/PiCoV UAE-HKU29 to be June 1981 (95% HPD, January 1964 to June 2010) 

227 and that of QuaCoV UAE-HKU30 to be February 1954 (95% HPD, October 1921 to 

228 September 2007) (Fig. 5). 

229 Western blot analysis. Prominent immunoreactive bands were visible for 33 

230 (75%) of 44 falcon serum samples. The band size observed (42 kDa) was consistent 

231 with the expected size of 41.1 kDa for the full-length His 6 -tagged recombinant N 

232 protein (Fig. 6). 
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234 DISCUSSION 

235 Similar to birds in Hong Kong, birds in the Middle East are also hosts for a diversity 

236 of deltacoronaviruses. In 2009, we reported the discovery of three novel avian Co Vs 

237 that were phylogenetically distinct from infectious bronchitis virus (31). These three 

238 CoVs were found in fecal samples of bulbuls, thrushes and munias in Hong Kong (31). 

239 In 2011, the ICTV approved the classification of these three avian CoVs as a novel 

240 genus Deltacoronavirus in the Coronaviridae family (4). In 2012, in a large 

241 epidemiological study, we discovered seven additional deltacoronaviruses (32). Six of 

242 them were from fecal samples of Japanese white eyes, tree sparrows, original magpie 

243 robins, black-crowned night herons, Eurasian wigeons, and common moorhens 

244 respectively and one from fecal samples of pigs in Hong Kong (32). In the last few 

245 years, PorCoV HKU15 (now officially named Coronavirus HKU15 ) has been widely 

246 detected in fecal samples of pigs in Canada, China, Laos, Mexico, South Korea, 

247 Thailand, Vietnam and the USA (33-41). Recently, we have also found PorCoV 

248 HKU15 in respiratory samples of pigs which may have implications on the possible 

249 routes and sites of infections (42). In this study, four additional novel 

250 deltacoronaviruses were detected in the fecal samples of falcons, houbara bustards, 

251 pigeons and quails in Dubai. Similar to the other deltacoronaviruses, FalCoV UAE- 

252 HKU27, HouCoV UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 

253 also have large genome sizes of 25,871 to 26,162 bases, owing to the presence of 

254 multiple ORFs downstream to the N gene. Continuous surveillance studies on birds in 

255 the Middle East and other regions will help better understand the viral and host 

256 diversity of deltacoronaviruses, and their potential for emergence in mammals. 

257 Recent interspecies jumping events were observed in the deltacoronaviruses 

258 from falcons, houbara bustards and pigeons, which were likely a result of predator- 
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259 and-prey relationship along the food chain. According to the ICTV definition for 

260 demarcation of CoV species, where Co Vs that share an overall amino acid identity of 

261 more than 90% in their seven conserved replicase domains (ADRP, nsp5 [3CL pi °], 

262 nspl2 [RdRp], nspl3 [Hel], nspl4 [ExoN], nspl5 [NendoU] and nspl6 [O-MT]) 

263 should be regarded as the same species, FalCoV UAE-HKU27, HouCoV UAE- 

264 HKU28 and PiCoV UAE-HKU29 should be subspecies of a novel CoV species. 

265 Notably, the S proteins of FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV 

266 UAE-HKU29, which are responsible for CoV receptor binding, also shared 94.5- 

267 99.8% amino acid identities, suggesting that the viruses have not evolved much yet to 

268 adapt to the corresponding avian host after jumping from one species to another. In 

269 fact, molecular clock analysis estimated that the tMRCA of HouCoV UAE-HKU28 

270 and PiCoV UAE-HKU29 was just around seven years ago and that of FalCoV UAE- 

271 HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 was just around 38 years 

272 ago (Fig. 5). Nevertheless, it should be noted that results of molecular clock 

273 estimation are only speculative for RNA viruses which are known for episodic 

274 evolution and adaptation to different environments (43). It is interesting to note that 

275 falcons, houbara bustards and pigeons are three radically different types of birds with 

276 unique behaviors and habitats. Falcons (order Falconiformes, family Falconidae ) are 

277 medium-sized birds of prey traditionally used for hunting wild quarry in the Arabian 

278 region. Houbara bustards (order Otidiformes, family Otididae) are large birds that are 

279 geographically restricted to arid habitats. Pigeons (order Columbiformes, family 

280 Columbidae) are relatively smaller and are globally distributed. Yet, these birds are 

281 ecologically closely related because falcons are known predators of pigeons and 

282 houbara bustards, and are also fed with the meat of these birds (44, 45). Moreover, 

283 falcons are trained by Arabian falconers to hunt houbara bustards because Arabs and 
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284 other Asians like Pakistanis believe their meat possesses aphrodisiac qualities and the 

285 meat is sold at high prices, despite banning of such controversial practices in some 

286 countries. Therefore, interspecies transmission of this CoV species is likely a result of 

287 the predator-and-prey relationship. Serological testing confirmed the presence of 

288 specific antibodies against FalCoV UAE-HKU27 in 75% of field falcon sera collected 

289 in Dubai. Such a relatively high seroprevalence is similar to those observed in other 

290 coronaviruses found in other animals, such as MERS-CoV and dromedary CoV UAE- 

291 HKU23 in dromedaries and rabbit CoV HKU14 in rabbits (7, 46), supporting 

292 widespread genuine infection among the falcon population and excluding the 

293 possibility of remnant viruses in falcon fecal samples resulting from ingestion of 

294 pigeons and houbara bustards. The detection of viruses from only one pigeon was not 

295 surprising, as viral shedding is often transient during the acute infection phase. 

296 Further molecular studies will reveal whether these three closely related 

297 deltacoronaviruses share a common receptor in these three phylogenetically distant 

298 birds, where viral adaptation to an evolutionarily conserved host-cell receptor might 

299 help offer facile interspecies transmissibility (47). 

300 Recombination involving the S protein is likely a common phenomenon 

301 among deltacoronaviruses, which may facilitate interspecies transmission and 

302 adaptation to new animal hosts. Phylogenetic analysis showed that the CoV species 

303 comprising FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 

304 was most closely related to WECoV HKU16 in the 3CL pi °, RdRp, Hel and N genes, 

305 but was only distantly related to WECoV HKU16 and most closely related to MRCoV 

306 E1KU18 in the S gene, suggesting recombination events around the S gene region as 

307 demonstrated by bootscan analysis (Fig. 4). As for QuaCoV UAE-HKU30, PorCoV 

308 HKU15 and SpCoV HKU17, according to the ICTV definition for demarcation of 
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309 CoV species, these three Co Vs should be classified as subspecies of the same CoV 

310 species. However, in contrast to FalCoV UAE-HKU27, HouCoV UAE-HKU28 and 

311 PiCoV UAE-HKU29 which are clustered together in all phylogenetic trees, QuaCoV 

312 UAE-HKU30, PorCoV HKU15 and SpCoV HKU17 showed phylogenetic positions 

313 shifting in different phylogenetic trees. QuaCoV UAE-HKU30 was clustered with 

314 PorCoV HKU15 and SpCoV HKU17 in the 3CL pro , RdRp, Hel and N genes, whereas 

315 it was most closely related to MunCoV-HKU13 in the S gene (Fig. 3). Recombination 

316 around the S gene region was also demonstrated by bootscan analysis (Fig. 4). 

317 Similarly, PorCoV HKU15 and/or Asian leopard cat CoV (ALCCoV) were most 

318 closely related to SpCoV HKU17 in the 3CL pro , RdRp, Hel and N genes, but were 

319 only distantly related to SpCoV HKU17 and most closely related to BuCoV HKU11 

320 in the S gene. This suggests that these mammalian deltacoronaviruses may have 

321 arisen from recombination events between SpCoV HKU17 and BuCoV HKU11 or 

322 related viruses. Recombination is not uncommon in other CoVs, and was found to be 

323 responsible for the emergence of SARS-CoV (48-51), generation of new genotypes or 

324 strains of other Co Vs including human CoV HKU1, human CoV OC43 and feline 

325 CoV type II strains (5, 6, 52). The present results suggest that recombination is also 

326 common among deltacoronaviruses, with the S gene being a frequent recombination 

327 site. Further studies may reveal if this may be an important mechanism for 

328 overcoming the mammalian species barrier through more efficient receptor binding to 

329 swine or other mammalian cells in PorCoV HKU15 and related viruses. 
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330 MATERIALS AND METHODS 

331 Ethical statement. Collection of animal fecal samples and field falcon sera in this 

332 study were approved by the Animal Ethic Committee of Central Veterinary Research 

333 Laboratory and Ministry of Climate Change and Environment, UAE according to the 

334 Ministerial Decree No. 384 of year 2008 on the executive by-law of the Federal Law 

335 No. 16 of year 2007 concerning animal welfare. All the experimental procedures were 

336 performed in accordance with the International Guiding Principles for Biomedical 

337 Research Involving Animals regarding the care and use of animals. 

338 Animal surveillance and sample collection. All animal fecal samples were 

339 left-over specimens submitted to the Central Veterinary Research Laboratory in Dubai, 

340 The United Arab Emirates for pathogen screening over a 24-month period (January 

341 2013 to December 2014). A total of 1,356 fecal samples from 1,164 mammals and 

342 192 birds were tested (Table 1). Serum samples from falcons in the field were 

343 collected in Dubai over a 4-month period (November 2015-February 2016). All 

344 samples were taken from the caudal tibial vein (vena metatarsalis plantaris 

345 superficialis ) under isoflurane-anesthesia in serum tubes, centrifuged and stored at - 

346 20°C until use. 

347 RNA extraction. Viral RNA was extracted from the fecal samples using 

348 RNeasy Mini Kit (Qiagen, Germany). The RNA was eluted in 50 pi of RNase-free 

349 water and was used as the template for RT-PCR. 

350 RT-PCR of RdRp gene of CoVs using deltacoronavirus-conserved 

351 primers and DNA sequencing. Initial CoV screening was performed by amplifying a 

352 440-bp fragment of the RdRp gene of Co Vs using deltacoronavirus-conserved primers 

353 (LPW16472 5'-GTGGVTGTMTTAATGCACAGTC-3' and LPW 16473 5'- 

354 T ACTG Y CT GTTRGTC ATRGTG- 3') we published previously (32). Reverse 
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355 transcription was performed using the Superscript III reverse transcriptase 

356 (Invitrogen, Carlsbad, CA). The PCR mixture (25 pi) contained cDNA, PCR buffer 

357 (10 mM Tris-HCl pH 8.3, 50 mM KC1. 3 mM MgCl 2 and 0.01% gelatin), 200 pM of 

358 each dNTPs and 1.0 U Taq polymerase (Applied Biosystem, Foster City, CA). The 

359 mixtures were amplified in 60 cycles of 94°C for 1 min, 48°C for 1 min and 72°C for 

360 1 min and a final extension at 72°C for 10 min in an automated thermal cycler 

361 (Applied Biosystem). Standard precautions were taken to avoid PCR contamination 

362 and no false-positive was observed in negative controls. 

363 The PCR products were gel-purified using the QIAquick Gel Extraction Kit 

364 (Qiagen). Both strands of the PCR products were sequenced twice with the ABI Prism 

365 31 30x1 Genetic Analyzer (Applied Biosystems), using the two PCR primers. The 

366 sequences of the PCR products were compared with known sequences of the RdRp 

367 genes of CoVs in the DDBJ/ENA/GenBank sequence databases. A phylogenetic tree 

368 was reconstructed using 371-bp fragments of the RdRp gene with the maximum 

369 likelihood method using the substitution model general time reversible with gamma 

370 distributed rate variation as well as estimated proportion of invariable sites 

371 (GTR+G+I) by PhyML.3.0. 

372 Complete genome sequencing. One complete genome each of FalCoV UAE- 

373 HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 and two complete 

374 genomes of QuaCoV UAE-HKU30 were amplified and sequenced using the RNA 

375 extracted from the original fecal specimens as templates. The RNA was converted to 

376 cDNA by a combined random-priming and oligo(dT) priming strategy. The cDNA 

377 was amplified by degenerate primers designed by multiple alignments of the genomes 

378 of other Co Vs with complete genomes available, using strategies described in our 

379 previous publications (7, 18, 29). Additional primers were designed from the results 
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380 of the first and subsequent rounds of sequencing. The 5' ends of the viral genomes 

381 were confirmed by rapid amplification of cDNA ends using the SMARTer 573’ 

382 RACE kit (Clontech, Mountain View, CA). Sequences were assembled and manually 

383 edited to produce final sequences of the viral genomes. 

384 Genome analysis. The nucleotide sequences of the genomes and the deduced 

385 amino acid sequences of the ORFs were compared to those of other Co Vs using 

386 ORFfinder (https://www.ncbi.nlm.nih.gov/orffinder/). Phylogenetic tree 

387 reconstruction was performed using the maximum likelihood (MF) method with 

388 PhyMF3.0. The best-fit substitution models were selected using PhyMF with Smart 

389 Model Selection. Protein family analysis was performed using PFAM and 

390 InterProScan (53, 54). Prediction of transmembrane domains was performed using 

391 TMHMM (55). 

392 Recombination analysis. To detect possible recombination events, bootscan 

393 analysis was performed by using the nucleotide alignment of the genome sequences of 

394 the novel deltacoronaviruses with Simp lot version 3.5.1, as previously described 

395 (56). The analysis was conducted using a sliding window of 1,000 nucleotides moving 

396 in 200 nucleotide steps with 1,000 bootstrap values. Possible recombination sites 

397 suggested by the bootscan analysis were confirmed through multiple sequence 

398 alignments. 

399 Estimation of divergence dates. Divergence times for the genus 

400 Deltacoronavirus was calculated using a Bayesian Markov Chain Monte Carlo 

401 (MCMC) approach as implemented in BEAST (Version 1.7.4) as described 

402 previously (50, 57). RdRp gene sequence data were selected for analyses under the 

403 substitution model GTR+G+I using an unrelaxed log-normal distributed (Ucld) 

404 relaxed molecular clock and a Bayesian Skyline coalescent model. MCMC run was 5 
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405 x 10 7 steps long, sampling every 1,000 steps. Convergence was assessed on the basis 

406 of the effective sampling size after a 10% bum-in using Tracer version 1.6. tMRCA 

407 and the HPD regions at 95% were calculated. The trees were summarized in a target 

408 tree by the Tree Annotator program included in the BEAST package by choosing the 

409 tree with the maximum sum of posterior probabilities (maximum clade credibility) 

410 after a 10% burn-in. 

411 Cloning and purification of HiSf,-tagged recombinant N protein of FalCoV 

412 UAE-HKU27. Cloning and purification of the His6-tagged recombinant N protein of 

413 FalCoV UAE-HKU27 was performed using a protocol we described previously (46, 

414 58). To produce a plasmid for protein purification, primers LPW36358 (5'- 

415 GGAATTCCAT ATGATGAGC ACTCCC AC AGTCCCT-3') and LPW36359 (5'- 

416 CCGCTCGAGATGCAGTTGAATCTCCATCCTG-3') were used to amplify the gene 

417 encoding the N protein of FalCoV UAE-HKU27 by RT-PCR. The sequence coding 

418 for amino acid residues 1 to 344 of the N protein was amplified and cloned into the 

419 Nde I and Xhol sites of the expression vector pET-28b(+) (Merck, Germany) in-frame 

420 and upstream of the histidine residue series. The recombinant N protein was 

421 expressed and purified using Ni-NTA agarose (Qiagen) according to the 

422 manufacturer’s instructions. 

423 Western blot analysis. Western blot analysis was performed according to our 

424 published protocol (46, 58). Briefly, 600 ng of purified His6-tagged recombinant N 

425 protein of FalCoV UAE-HKU27 was loaded into the well of a sodium dodecyl sulfate 

426 (SDS)-10% polyacrylamide gel and subsequently electrophoresed and then 

427 electroblotted onto a nitrocellulose membrane (Bio-Rad, Hercules, CA). The blot was 

428 cut into strips and each separated strip was incubated with an individual serum sample, 

429 in a dilution of 1:1,000, obtained from falcons in Dubai. Antigen-antibody interaction 
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430 was detected with an in-house developed polyclonal guinea pig anti-falcon IgY 

431 antibody (59) at a dilution of 1:2,000, a horseradish peroxidase (HRP)-conjugated 

432 rabbit anti-guinea pig IgG antibody (Invitrogen) at a dilution of 1:4,000, and the 

433 WestemBright Quantum HRP substrate (Advansta, USA). Monoclonal anti-His6 

434 antibody (clone HIS.H8; Invitrogen) at a concentration of 0.5 p g/mL, with HRP- 

435 conjugated goat anti-mouse IgG antibody (Invitrogen) at a dilution of 1:4,000 as the 

436 secondary antibody, was used as the positive control. 

437 Nucleotide sequence accession numbers. The nucleotide sequences of the 

438 five complete genomes of FalCoV UAE-HKU27, HouCoV UAE-HKU28, PiCoV 

439 UAE-HKU29 and QuaCoV UAE-HKU30 have been lodged into the 

440 DDBJ/ENA/GenBank sequence databases with the accession numbers LC364342- 

441 LC364346. 
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696 LEGENDS TO FIGURES 

697 FIG. 1. Phylogenetic analysis of amino acid sequences of the 371-bp fragment 

698 (excluding primer sequences) of RNA-dependent RNA polymerase (RdRp) of 

699 coronaviruses (CoVs) identified from birds from Dubai in the present study. The tree 

700 was reconstructed by the maximum-likelihood method using PhyML 3.0 with the 

701 substitution model general time reversible with gamma distributed rate variation and 

702 estimated proportion of invariable sites (GTR+G+I). Bootstrap values were calculated 

703 from 1,000 trees. The scale bar indicates the number of nucleotide substitutions per 

704 site. The eight newly identified coronaviruses are shown in bold. List of viruses and 

705 their respective DDBJ/ENA/GenBank accession numbers are as follow: ALCCoV, 

706 Asian leopard cat coronavirus (EF584908); Badger SARS-CoV, SARS-related 

707 Chinese ferret badger CoV (AY545919); BCoV, bovine CoV (NC_003045); BdCoV 

708 HKU22, bottlenose dolphin CoV HKU22 (KE793826); BuCoV HKU11, bulbul CoV 

709 HKU11 (FJ376619); BWCoV SW1, Beluga whale CoV SW1 (NCJH0646); Camel 

710 MER-CoV, Camel Middle East respiratory syndrome CoV (KT751244); ChRCoV 

711 HKU24, China Rattus CoV HKU24 (KM349742); Civet SARS-CoV, SARS-related 

712 palm civet CoV (AY304488); CMCoV HKU21, common-moorhen CoV HKU21 

713 (NC_016996); DcCoV HKU23, dromedary camel CoV HKU23 (KE906251); FalCoV 

714 UAE-HKU27, falcon CoV UAE-HKU27; FIPV, feline infectious peritonitis virus 

715 (AY994055); GiCoV, giraffe CoV (EF424622); HouCoV UAE-HKU28, houbara 

716 CoV UAE-HKU28; HCoV 229E, human CoV 229E (NC_002645); HCoV HKU1, 

717 human CoV HKU1 (NC_006577); HCoV NL63, human CoV NL63 (NC_005831); 

718 HCoV OC43, human CoV OC43 (NC_005147); Human MERS-CoV, human Middle 

719 East respiratory syndrome CoV (JX869059); Human SARS-CoV, severe acute 

720 respiratory syndrome-related human CoV (NC_004718); IBV, infectious bronchitis 
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721 virus (NC_001451); IBV-partridge, partridge coronavirus (AY646283); IBV-peafowl. 

722 peafowl coronavirus (AY641576); MHV, murine hepatitis virus (NC_001846); 

723 MRCoV HKU18, magpie robin CoV HKU18 (NC_016993); MunCoV HKU13, 

724 munia CoV HKU13 (FJ376622); NHCoV HKU19, night-heron CoV HKU19 

725 (NC_016994); PEDV, porcine epidemic diarrhea virus (NC_003436); PHEV, porcine 

726 hemagglutinating encephalomyelitis virus (NC_007732); PiCoV UAE-HKU29, 

727 pigeon CoV UAE-HKU29; Pi-BatCoV HKU5, Pipistrellus bat CoV HKU5 

728 (NC_009020); PorCoV HKU15, porcine CoV HKU15 (NC_016990); PRCV, porcine 

729 respiratory CoV (DQ811787); QuaCoV UAE-HKU30, quail CoV UAE-HKU30; 

730 RbCoV HKU14, rabbit CoV HKU14 (JN874559); Rh-BatCoV HKU2, Rhinolophus 

731 bat CoV HKU2 (EF203064); Ro-BatCoV HKU9, Rousettus bat CoV HKU9 

732 (NC_009021); SACoV, sable antelope CoV (EF424621); SARSr-Rs-BatCoV HKU3, 

733 SARS-related Rhinolophus bat CoV HKU3 (DQ022305); Sc-BatCoV 512, 

734 Scotophilus bat CoV 512 (NC_009657); SpCoV HKU17, sparrow CoV HKU17 

735 (NC_016992); TGEV, transmissible gastroenteritis virus (NC_002306); ThCoV 

736 HKU12, thrush CoV HKU12 (FJ376621); Ty-BatCoV HKU4, Tylonycteris bat CoV 

737 HKU4 (NC_009019); WECoV HKU16, white-eye CoV HKU16 (NC_016991); 

738 WiCoV HKU20, wigeon CoV HKU20 (NC_016995). 

739 FIG. 2. Genome organization of members of Deltacoronavirus. Open reading frames 

740 downstream of spike (S) gene are magnified to show the differences among the 

741 genomes of the 10 CoVs. Papain-like protease (PL pi °), chymotrypsin-like protease 

742 (3CL pi °), and RNA-dependent RNA polymerase (RdRp) genes are represented by 

743 green boxes. S, envelope (E), membrane (M) and nucleocapsid (N) genes are 

744 represented by orange boxes. Putative accessory proteins are represented by blue 
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745 boxes. The novel coronaviruses discovered in this study are shown in bold. 

746 Abbreviations for the viruses are the same as those in Fig. 1. 

747 FIG. 3. Phylogenetic analyses of chymotrypsin-like protease (3CL pro ), RNA- 

748 dependent RNA polymerase (RdRp), helicase (Hel), spike (S) protein and 

749 nucleocapsid (N) protein of Falcon CoV-FIKU27, Houbara CoV-FlKU28, Pigeon 

750 CoV-HKU29 and Quail CoV-HKU30. The trees were reconstructed by the maximum- 

751 likelihood method using PhyML 3.0 with the substitution models Le and Gascuel (LG) 

752 with gamma distributed rate variation (G) (3CL pi °); LG with G, estimated proportion 

753 of invariable sites (I) and empirical frequencies (F) (RdRp); LG+G+F (Hel and N); 

754 and Whelan and Goldman (WAG)-tG-tI-tF (S) with bootstrap values calculated from 

755 1,000 trees. 314, 944, 599, 1561 and 392 amino acid positions in 3CL pro , RdRp, Hel. 

756 S and N, respectively, were included in the analyses. The scale bars indicate the 

757 number of amino acid substitutions per site. Viruses characterized in this study are in 

758 bold. Abbreviations for the viruses are the same as those in Fig. 1. 

759 FIG. 4. Detection of possible recombination by bootscan analysis. Bootscanning was 

760 conducted with Simplot version 3.5.1 (F84 model; window size, 1,000 bp; step, 

761 200 bp). (A) Falcon CoV UAE-HKU27 (FalCoV UAE-HKU27) was used as the 

762 query sequence and compared with the genome sequences of white-eye coronavirus 

763 HKU16 (WECoV HKU16), magpie robin coronavirus HKU18 (MRCoV HKU18) and 

764 ThCoV HKU12 thrush coronavirus HKU12 (ThCoV HKU12). (B) Quail CoV UAE- 

765 HKU30 (QuaCoV UAE-HKU30) was used as the query sequence and compared with 

766 the genome sequences of sparrow coronavirus HKU17 (SpCoV HKU17), munia 

767 coronavirus HKU13 (MunCoV HKU13) and ThCoV HKU12. 

768 FIG. 5. Estimation of the mean time to the most recent common ancestor (tMRCA) 

769 for Deltacoronavirus. The time-scaled phylogeny was summarized from all Markov 
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770 Chain Monte Carlo (MCMC) phylogenies of the RNA-dependent RNA polymerase 

771 (RdRp) gene data set analyzed under the relaxed-clock model with an uncorrelated 

772 log-normal distribution in BEAST version 1.7.4. Viruses characterized in this study 

773 are in bold. Abbreviations for the viruses are the same as those in Fig. 1. 

774 FIG. 6. Western blot analysis of falcon CoV UAE-HKU27 (FalCoV UAE-HKU27) 

775 using nucleocapsid (N) protein expressed in Escherichia coli. Fane 1, purified 

776 recombinant FalCoV UAE-F1KU27 N protein; Fane 2, falcon serum sample (FS7) 

777 strongly positive for antibody against FalCoV UAE-HKU27 N protein; Fane 3: falcon 

778 serum sample (FS5) negative for antibody against FalCoV UAE-F1KU27 N protein. 

779 

780 
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781 Table 1. Animals screened and their associated coronaviruses (CoVs) in the present 

782 study 


Animals 

No. of 

specimens 

tested 

No. (%) of 
specimens 
positive for CoV 

CoV 

Birds 

192 

8 (4.16%) 


Black swan 

1 

0 

- 

Crowned crane 

1 

0 

- 

Eclectus parrot 

1 

0 

- 

Falcon 

34 

1 (2.94%) 

Falcon CoV UAE-HKU27 (n=l) 

Flamingo 

3 

0 

- 

Grey parrot 

1 

0 

- 

Heuglin’s bustard 

10 

0 

- 

Houbara bustard 

36 

1 (2.77%) 

Houbara CoV UAE-HKU28 (n=l) 

Kori bustard 

4 

0 

- 

Myna 

1 

0 

- 

Ostrich 

4 

0 

- 

Peacock 

1 

0 

- 

Pigeon 

18 

1 (7.14%) 

Pigeon CoV UAE-HKU29 (n=l) 

Rhea 

1 

0 

- 

Thick-knee 

1 

0 

- 

Stone curlew 

22 

0 

- 

Partridge 

8 

0 

- 

Chicken 

19 

0 

- 

Duck 

2 

0 

- 

Guineafowl 

6 

0 

- 

Pheasant 

7 

0 

- 

Quail 

10 

5 (50%) 

Quail CoV UAE-HKU30 (n=5) 

Sand grouse 

1 

0 

- 

Mammals 

1,164 

0 


Antelope 

90 

0 

- 

Cat 

59 

0 

- 

Cattle 

3 

0 

- 

Camel 

754 

0 

- 

Dog 

145 

0 

- 

Goat 

34 

0 

- 

Horse 

44 

0 

- 

Lion 

7 

0 

- 

Monkey 

6 

0 

- 

Rabbit 

9 

0 

- 

Rodent 

13 

0 

- 
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784 Table 2. Comparison of genomic features and amino acid identities between the four novel deltacoronaviruses, representative members 

785 of alpha-, beta- and gammacoronaviruses and other deltacoronaviruses 

786 

787 CoV Genome size (base) G+C content Amino acid identity (%) 

FalCoV UAE-HKU27_HouCoV UAE-HKU28_PiCoV UAE-HKU29_QuaCoV UAE-HKU30 





3CL pro 

RdRp Hel 

S 

N 

3CL pro 

RdRp Hel 

S 

N 

3CL pro 

RdRp 

Hel 

S 

N 

3CL pro 

RdRp Hel 

S 

N 

Alphacoronavirus 

HCoV 229E 

27,317 

0.38 

35.8 

49.1 

47.7 

44.8 

21.4 

35.5 

49.0 

47.7 

44.8 

21.4 

35.8 

49.1 

47.7 

44.8 

21.4 

34.8 

48.9 

50.6 

41.1 

22.3 

HCoV NL63 

27,553 

0.34 

36.7 

48.9 

47.5 

39.3 

19.8 

36.7 

48.8 

47.5 

39.5 

20.5 

37.0 

48.9 

47.5 

39.4 

20.5 

36.0 

48.8 

49.6 

37.7 

21.5 

Betcicoroncivirus 

Lineage A 

PCoV 

31,028 

0.37 

36.5 

51.9 

47.2 

24.9 

22.7 

36.5 

52.0 

47.2 

24.9 

22.5 

38.1 

52.1 

47.2 

25.0 

22.5 

37.2 

51.4 

48.6 

25.4 

23.4 

HCoV HKU1 

29,926 

0.32 

38.1 

51.6 

47.2 

25.5 

20.1 

37.8 

51.6 

47.2 

25.8 

20.1 

36.9 

51.7 

47.2 

25.8 

20.1 

38.1 

51.2 

48.3 

25.0 

21.3 

Lineage B 

SARS-CoV 

29,751 

0.41 

35.8 

50.2 

50.2 

25.9 

24.2 

35.5 

50.2 

50.0 

24.5 

24.0 

35.8 

50.3 

50.0 

24.5 

24.0 

34.8 

51.1 

51.9 

25.0 

25.3 

Lineage C 

MERS-CoV 

30,119 

0.41 

36.7 

50.7 

48.9 

25.8 

24.2 

36.4 

50.7 

48.9 

25.0 

23.5 

36.7 

50.8 

48.9 

25.0 

23.5 

35.8 

51.4 

50.2 

25.3 

23.1 

Ty-BatCoV HKU4 

30,286 

0.38 

36.0 

51.2 

48.9 

25.1 

24.2 

36.3 

51.2 

48.9 

25.3 

24.1 

36.6 

51.3 

48.9 

25.3 

24.1 

35.7 

50.6 

49.7 

24.0 

23.8 

Lineage D 

Ro-Bat-CoV HKU9 

29,114 

0.41 

38.0 

52.5 

49.2 

26.7 

24.3 

38.0 

52.4 

49.0 

26.6 

24.5 

38.3 

52.5 

49.0 

26.6 

24.5 

37.4 

51.7 

51.1 

26.9 

24.2 

Gammacoronavirus 

IBV 

27,608 

0.38 

42.9 

54.8 

53.4 

27.9 

28.2 

42.9 

54.9 

53.4 

28.5 

28.4 

43.3 

55.0 

53.4 

28.6 

28.4 

44.6 

54.6 

56.1 

29.5 

29.0 

Deltacoronavirus 

BuCoV HKU11 

26,476 

0.39 

85.0 

88.8 

94.5 

45.2 

71.5 

84.4 

89.1 

94.7 

45.0 

72.1 

85.0 

88.9 

94.7 

45.2 

72.1 

81.1 

87.5 

89.4 

68.1 

73.4 

ThCoV HKU12 

26,396 

0.38 

84.7 

86.9 

94.5 

46.3 

78.6 

84.7 

87.0 

94.4 

47.0 

79.1 

85.3 

86.8 

94.4 

46.8 

79.1 

82.4 

87.2 

89.9 

47.9 

79.9 

MunCoV HKU13 

26,552 

0.43 

77.9 

87.6 

88.3 

46.2 

74.1 

77.5 

87.6 

88.1 

46.5 

74.4 

77.9 

87.4 

88.1 

46.4 

74.4 

84.0 

89.9 

96.1 

73.0 

77.4 

PorCoV HKU15 

25,421 

0.43 

80.8 

87.4 

88.1 

45.7 

75.4 

80.1 

87.5 

88.0 

46.0 

75.7 

80.8 

87.3 

88.0 

45.9 

75.7 

91.2 

95.2 

98.5 

71.8 

90.9 

WECoV HKU16 

26,027 

0.40 

86.6 

91.9 

96.8 

47.1 

82.7 

86.3 

91.9 

97.0 

47.7 

83.6 

87.0 

91.7 

97.0 

47.8 

83.6 

78.5 

86.7 

88.4 

62.2 

75.5 

SpCoV HKU17 

26,067 

0.45 

80.8 

87.5 

88.3 

67.3 

75.9 

80.1 

87.6 

88.1 

67.9 

76.8 

80.8 

87.4 

88.1 

67.7 

76.8 

93.8 

95.8 

98.7 

44.7 

91.8 

MRCoV HKU18 

26,674 

0.47 

78.8 

87.1 

88.3 

68.6 

73.4 

78.5 

87.1 

88.1 

68.2 

73.6 

78.8 

86.9 

88.1 

68.2 

73.6 

85.7 

90.3 

96.3 

45.0 

78.2 

NHCoV HKU19 

26,064 

0.38 

58.3 

72.3 

75.9 

45.6 

50.4 

57.6 

72.3 

76.2 

46.0 

50.7 

57.9 

72.3 

76.2 

46.2 

50.7 

53.7 

72.3 

78.4 

41.8 

51.6 

WiCoV HKU20 

26,211 

0.39 

58.3 

72.1 

73.8 

45.6 

49.3 

57.7 

72.0 

73.8 

45.7 

50.8 

58.3 

71.9 

73.8 

45.7 

50.8 

59.0 

71.3 

75.4 

43.0 

51.4 

CMCoV HKU21 

26,212 

0.35 

76.2 

84.8 

89.1 

47.1 

64.7 

75.9 

84.7 

89.2 

46.5 

65.3 

76.5 

84.5 

89.2 

46.5 

65.3 

71.7 

83.5 

84.6 

51.5 

61.6 

FalCoV UAE-HKU27 

26,155 

0.39 






99.3 

98.9 

99.5 

94.5 

100 

99.3 

98.7 

99.5 

94.5 

99.1 

81.8 

86.3 

88.3 

45.8 

74.2 

HouCoV UAE-HKU28 

26,155 

0.39 

98.7 

98.9 

99.5 

94.5 

99.1 






99.3 

99.8 

100 

99.8 

100 

81.1 

86.4 

88.1 

46.0 

74.8 

PiCoV UAE-HKU29 

26,162 

0.39 

99.3 

98.7 

99.5 

94.5 

99.1 

98.7 

99.8 

100 

99.8 

99.1 






81.8 

86.2 

88.1 

46.1 

74.8 

QuaCoV UAE-HKU30 

25,871 

0.42 

80.8 

86.3 

88.3 

45.8 

74.8 

80.1 

86.4 

88.1 

46.0 

74.8 

80.8 

86.2 

88.1 

46.1 

74.8 
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788 Table 3. Coding potential and putative transcription regulatory sequences of novel 

789 deltacoronavirus genomes 

790 


CoVs ORF 

Location 

Frame 

Length (aa) 

Length (nt) 

TRS location 

TRS sequence 
(distance in bases to AUG) 

FalCoV UAE-HKU27 lab 

595-19,271 

+1, +3 

6226 

18,678 

71 

ACACCA (523) AUG 

S 

19,253-22,855 

+2 

1201 

3,603 

19112 

AC ACC A (140) AUG 

E 

22,849-23,097 

+1 

83 

249 

22828 

ACACCU (20) AUG 

M 

23,090-23,746 

+2 

219 

657 

23072 

ACACCA (17) AUG 

NS6 

23,746-24,027 

+1 

94 

282 

23726 

ACGCCA (19) AUG 

N 

24,051-25,085 

+3 

345 

1,035 

24042 

ACACCA (8) AUG 

NS7a 

24,079-24,735 

+1 

219 

657 

24042 

ACACCA (36) AUG 

NS7b 

25,066-25,272 

+1 

69 

207 

25042 

ACAACG (18) AUG 

NS7c 

25,259-25,636 

+2 

126 

378 

25244 

AGACCU (14) AUG 

NS7d 

25,629-25,826 

+3 

66 

198 

25618 

AUACCA (10) AUG 

HouCoV UAE-HKU28 lab 

588-19,264 

+3,+2 

6226 

18,678 

70 

ACACCA (517) AUG 

S 

19,246-22,848 

+1 

1201 

3,603 

19105 

ACACCA (140) AUG 

E 

22,842-23,090 

+3 

83 

249 

22821 

ACACCU (20) AUG 

M 

23,083-23,739 

+1 

219 

657 

23065 

ACACCA (17) AUG 

NS6 

23,739-24,020 

+3 

94 

282 

23719 

ACGCCA (19) AUG 

N 

24,044-25,078 

+2 

345 

1,035 

24035 

ACACCA (8) AUG 

NS7a 

24,072-24,728 

+3 

219 

657 

24035 

ACACCA (36) AUG 

Ns7b 

25,059-25,265 

+3 

69 

207 

25040 

ACAACG (18) AUG 

NS7c 

25,252-25,629 

+1 

126 

378 

25237 

AGACCU (14) AUG 

Ns7d 

25,622-25,819 

+2 

66 

198 

25611 

AUACCA (10) AUG 

PiCoV UAE-HKU29 lab 

588-19,264 

+3,+2 

6226 

18,678 

70 

ACACCA (517) AUG 

S 

19,246-22,848 

+1 

1201 

3,603 

19105 

ACACCA (140) AUG 

E 

22,842-23,090 

+3 

83 

249 

22821 

ACACCU (20) AUG 

M 

23,083-23,739 

+1 

219 

657 

23065 

ACACCA (17) AUG 

NS6 

23,739-24,020 

+3 

94 

282 

23719 

ACGCCA (19) AUG 

N 

24,044-25,078 

+2 

345 

1,035 

24035 

ACACCA (8) AUG 

NS7a 

24,072-24,728 

+3 

219 

657 

24035 

ACACCA (36) AUG 

Ns7b 

25,059-25,265 

+3 

69 

207 

25040 

ACAACG (18) AUG 

NS7c 

25,252-25,629 

+1 

126 

378 

25237 

AGACCU (14) AUG 

Ns7d 

25,622-25,819 

+2 

66 

198 

25611 

AUACCA (10) AUG 

QuaCoV UAE-HKU30 lab 

522-19,312 

+3,+2 

6121 

18,363 

62 

ACACCA (459) AUG 

S 

19,294-22,770 

+1 

1159 

3,477 

19153 

ACACCA (140) AUG 

E 

22,764-23,015 

+3 

84 

252 

22740 

CAACCA (23) AUG 

M 

23,008-23,661 

+1 

218 

654 

22987 

ACACCA (20) AUG 

NS6 

23,661-23,942 

+3 

94 

282 

23614 

ACACCA (46) AUG 

N 

23,967-24,992 

+3 

342 

1,026 

23959 

ACACCA (7) AUG 

NS7a 

24,061-24,663 

+1 

201 

603 

23978 

GCTCCA (82) AUG 

NS7b 

25,003-25,419 

+1 

139 

417 

24998 

ACACCA (4) AUG 

NS7c 

25,358-25,579 

+2 

74 

222 

25347 

ACACCA (10) AUG 
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791 Table 4. Putative cleavage sites at the junctions between non-structural proteins in FalCoV UAE-HKU27, HouCoV UAE-HKU28, PigCoV 

792 UAE-HKU29 and QuaCoV UAE-HKU30 compared with other deltacoronaviruses 

793 


nsp 

PorCoV 

HKU15 

WECoV 

HKU16 

SpCoV 

HKU17 

MRCoV 

HKU18 

NHCoV 

HKU19 

WiCoV 

HKU20 

CMCoV 

HKU21 

FalCoV 

UAE- 

HKU27 

HouCoV 

UAE- 

HKU28 

PigCoV 

UAE- 

HKU29 

QuaCoV 

UAE- 

HKU30 

nsp2/nsp3 

AG/SD 

AG/SD 

AG/SD 

AG/AD 

VG/GL 

DG/VY 

AG/VS 

AG/SD 

AG/SD 

AG/SD 

AG/SD 

nsp3/nsp4 

AG/AP 

AG/AR 

AG/AP 

AG/AM 

TG/GN 

GG/SK 

AG/KF 

AG/RK 

AG/RK 

AG/RK 

AG/AP 

nsp4/nsp5 

LQ/AG 

LQ/AG 

LQ/AG 

LQ/AG 

VQ/AG 

VQ/SG 

VQ/AG 

LQ/AG 

LQ/AG 

LQ/AG 

LQ/AG 

nsp5/nsp6 

LQ/SG 

LQ/SN 

LQ/SG 

LQ/SG 

LQ/GT 

LQ/AN 

LQ/AS 

LQ/SG 

LQ/SG 

LQ/SG 

LQ/SG 

nsp6/nsp7 

VQ/NK 

VQ/NK 

VQ/NK 

VQ/NK 

VQ/NK 

VQ/NR 

VQ/NR 

VQ/NR 

VQ/NR 

VQ/NR 

VQ/NK 

nsp7/nsp8 

VQ/AV 

VQ/AV 

VQ/AV 

VQ/AV 

LQ/VV 

LQ/VV 

LQ/VV 

LQ/AV 

LQ/AV 

LQ/AV 

VQ/AV 

nsp8/nsp9 

LQ/NN 

LQ/NN 

LQ/NN 

LQ/NN 

LQ/NN 

CQ/NN 

LQ/NN 

LQ/NN 

LQ/NN 

LQ/NN 

LQ/NN 

nsp9/nspl0 

LQ/AS 

LQ/AN 

LQ/AN 

LQ/AN 

LQ/SS 

LQ/AN 

LQ/AT 

LQ/AN 

LQ/AN 

LQ/AN 

LQ/AN 

nsplO/nspl 1 

LQ/NS 

LQ/GS 

LQ/NS 

LQ/NS 

LQ/LG 

LQ/SN 

LQ/NT 

LQ/GS 

LQ/GS 

LQ/GS 

LQ/NS 

nspl2/nspl3 

LQ/AS 

LQ/AS 

LQ/AS 

LQ/AS 

LQ/AT 

LQ/AT 

LQ/AS 

LQ/AS 

LQ/AS 

LQ/AS 

LQ/AS 

nspl3/nspl4 

LQ/SS 

LQ/SS 

LQ/SS 

LQ/AG 

VQ/SL 

VQ/AE 

VQ/CS 

LQ/SS 

LQ/SS 

LQ/SS 

LQ/SG 

nspl4/nspl5 

LQ/NL 

LQ/NL 

LQ/NL 

LQ/NL 

LQ/TL 

LQ/TL 

LQ/TI 

LQ/NL 

LQ/NL 

LQ/NL 

LQ/NL 

nsp!5/nspl6 

LQ/SL 

VQ/SL 

LQ/SL 

LQ/SL 

VQ/AL 

LQ/SL 

VQ/SL 

VQ/AL 

VQ/AL 

VQ/AL 

LQ/SL 
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795 Table 5A. Comparison of amino acid identities (%) of the seven conserved 

796 replicase domains for species demarcation among FalCoV UAE-HKU27, HouCoV 

797 UAE-HKU28, PiCoV UAE-HKU29, and WEC 0 V-HKUI 6 

798 


CoV 

Domains 

WECoV 

HKU16 

FalCoV 

UAE-HKU27 

HouCoV 

UAE-HKU28 

PiCoV 

UAE-HKU29 

WECoV 

ADRP 

- 

68.4 

68.7 

68.8 

HKU16 

3CL pro 

- 

86.6 

86.3 

87.0 


RdRp 

- 

91.9 

91.9 

91.7 


Hel 

- 

96.8 

97.0 

97.0 


ExoN 

- 

93.6 

93.8 

93.8 


NendoU 

- 

86.2 

85.6 

85.6 


O-MT 

- 

89.2 

89.2 

88.9 


Concatenated 

- 

84.4 

84.4 

84.4 

FalCoV 

ADRP 

- 

- 

98.6 

98.6 

UAE-HKU27 

3CL pro 

- 

- 

98.7 

98.7 


RdRp 

- 

- 

98.9 

98.7 


Hel 

- 

- 

99.5 

99.5 


ExoN 

- 

- 

99.8 

99.8 


NendoU 

- 

- 

99.1 

99.1 


O-MT 

- 

- 

100 

99.6 


Concatenated 

- 

- 

99.1 

99.1 

HouCoV 

ADRP 

- 

- 

- 

99.9 

UAE-HKU28 

3CL pro 

- 

- 

- 

99.3 


RdRp 

- 

- 

- 

99.8 


Hel 

- 

- 

- 

100 


ExoN 

- 

- 

- 

100 


NendoU 

- 

- 

- 

100 


O-MT 

- 

- 

- 

99.6 


Concatenated 

- 

- 

- 

99.9 


799 


Downloaded from http://jvi.asm.org/on May 18, 2018 by Kaohsiung Medical University 





Journal of Virology JVI Journal of Virology Accepted ManUSCrTpFPoSted Ollfme 


800 Table 5B. Comparison of amino acid identities (%) of the seven conserved 

801 replicase domains for species demarcation among QuaCoV UAE-HKU30, PorCoV- 

802 HKU15, and SpCoV-HKU17 

803 


CoV 

Domains 

PorCoV 

HKU15 

SpCoV 

HKU17 

QuaCoV 

UAE-HKU30 

PorCoV HKU15 

ADRP 

- 

89.3 

83.0 


3CL pro 

- 

97.1 

91.2 


RdRp 

- 

97.8 

95.2 


Hel 

- 

99.2 

98.5 


ExoN 

- 

97.3 

96.1 


NendoU 

- 

96.3 

91.1 


O-MT 

- 

97.5 

96.4 


Concatenated 

- 

95.0 

91.3 

SpCoV HKU17 

ADRP 

- 

- 

83.0 


3CL pro 

- 

- 

93.8 


RdRp 

- 

- 

95.8 


Hel 

- 

- 

98.7 


ExoN 

- 

- 

97.1 


NendoU 

- 

- 

92.0 


O-MT 

- 

- 

95.7 


Concatenated 

- 

- 

91.8 


804 
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